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1 The Query Language to XML Documents Connected by XLink Links H 
D. A. Lizorkin 

May 2005 Programming and Computing Software, volume 31 issue 3 
Publisher: Plenum Press 

Additional Information: full citation , abstract , index terms 

The XML linking language (XLink) is a language for describing links between resources by 
means of XML attributes and a special namespace. The specification of the XLink language 
developed by the World Wide Web consortium provides only data structures for describing 
links and a minimal model of their behavior. In this paper, a language that allows the 
application to transparently formulate queries to XLink links and traverse the arcs 
determined by these links is suggested. The suggested langua ... 



2 Information retrieval session 7: web: Combining link-based and content-based 
methods for web document classification 

Pavel Calado, Marco Cristo, Edleno Moura, Nivio Ziviani, Berthier Ribeiro-Neto, Marcos Andre 
Gongalves 

November 2003 Proceedings of the twelfth international conference on Information 
and knowledge management 

Publisher: ACM Press 

Additional Information: full citation , abstract , references , citings , index 
terms 



Full text available: pdf(206.14 KB) 



This paper studies how link information can be used to improve classification results for 
Web collections. We evaluate four different measures of subject similarity, derived from 
the Web link structure, and determine how accurate they are in predicting document 
categories. Using a Bayesian network model, we combine these measures with the results 
obtained by traditional content-based classifiers. Experiments on a Web directory show 
that best results are achieved when links from pages outside the ... 



Keywords: Bayesian networks, classification, link analysis, web 



THESUS: Organizing Web document collections based on link semantics 

Maria Halkidi, Benjamin Nguyen, Iraklis Varlamis, Michalis Vazirgiannis 

November 2003 The VLDB Journal — The International Journal on Very Large Data 

Bases, Volume 12 Issue 4 

Publisher: Springer-Verlag New York, Inc. 
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Full text available: |%] pdf(262.85 KB) Additional Information: full citation , abstract , citings , index terms 

The requirements for effective search and management of the WWW are stronger than 
ever. Currently Web documents are classified based on their content not taking into 
account the fact that these documents are connected to each other by links. We claim 
that a page's classification is enriched by the detection of its incoming links' semantics. 
This would enable effective browsing and enhance the validity of search results in the 
WWW context. Another aspect that is underaddressed and str ... 

Keywords: Document clustering, Link analysis, Link management, Semantics, Similarity 
measure, World Wide Web 



Link aggregation: Untangling compound documents on the web 
Nadav Eiron, Kevin S. McCurley 

August 2003 Proceedings of the fourteenth ACM conference on Hypertext and 
hypermedia 

Publisher: ACM Press 

Full text available- Wl odf(192 59 KB) Addit,onal Information: full citation , abstract , references , citings , index 
, ^ fi - J ! terms 

Most text analysis is designed to deal with the concept of a "document", namely a 
cohesive presentation of thought on a unifying subject. By contrast, individual nodes on 
the World Wide Web tend to have a much smaller granularity than text documents. We 
claim that the notions of "document" and "web node" are not synonymous, and that 
authors often tend to deploy documents as collections of URLs, which we call "compound 
documents". In this paper we present new techniques for identifying and workin ... 

Keywords: composites, hypertext, information retrieval, semantic web, wasted space 



5 Technical papers: design recovery and documentation: Recovering documentation- 
to-source-code traceabilitv links using latent semantic indexing 

Andrian Marcus, Jonathan I. Maletic 

May 2003 Proceedings of the 25th International Conference on Software 
Engineering 

Publisher: IEEE Computer Society 

Full text available: ^ p df(1 15 mb)^ I Additional Information: full citation , abstract , references , citings , index 
Publisher Site 

An information retrieval technique, latent semantic indexing, is used to automatically 
identify traceability links from system documentation to program source code. The results 
of two experiments to identify links in existing software systems (i.e., the LEDA library, 
and Albergate) are presented. These results are compared with other similar type 
experimental results of traceability link identification using different types of information 
retrieval techniques. The method presented proves to give ... 

6 Linking documents: XLinkProxy: external linkbases with XLink 
Paolo Ciancarini, Federico Folli, Davide Rossi, Fabio Vitali 

November 2002 Proceedings of the 2002 ACM symposium on Document engineering 
Publisher: ACM Press 

Full text available' pdf(282 33 KB) Additional Information: full citation , abstract , references , citings , index 
' k^—* ' terms 

In the linking model of the World Wide Web each link is stored in the referring document 
within an attribute of the A tag. All the hyperlink defined this way can reference a single 
resource or a single fragment. With the evolution of Web technologies more powerful 
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linking languages (XLink and XPointer) have been proposed. Here we introduce 
XLinkProxy, a Web application that allows sophisticated hyperlink (defined using XLink 
and XPointer) to be defined outside referring documents, giving users th ... 

Keywords: XLink, XPointer, external linkbases 



7 Linking documents: An open linking service supporting the authoring of web 
documents 



Renato Bulcao Neto, Claudia Akemi Izeki, Maria da Graga Pimentel, Renata Pontin Fortes, 
Khai Nhut Truong 

November 2002 Proceedings of the 2002 ACM symposium on Document engineering 
Publisher: ACM Press 

Full text available - W\ odf(447 35 KB) Additional Information: full citation , abstract , references , citings , index 

: terms 

Both content driven web authors and application designers may have their attention 
deviated from their main task when they have to be concerned with the generation of 
elaborated linking structures. This work aims to demonstrate how a metadata-enhanced 
web-based open linking service can be exploited towards supporting content driven 
authors in their tasks. The following results are presented in this paper: (a) the Web 
Linking Service (WLS), a novel open hypermedia system that stores and exchanges ... 

Keywords: RDF-based metadata, content driven authoring, document engineering, open 
linking service, web engineering 



8 Linking documents: XConnector: extending XLink to provide multimedia 
^ synchronization 

^ Debora C. Muchaluat-Saade, Rogerio F. Rodrigues, Luiz Fernando G. Soares 

November 2002 Proceedings of the 2002 ACM symposium on Document engineering 

Publisher: ACM Press 

Full text available* f ?\ pdf(267 32 KB) Additiona l Information: full citation , abstract , references , citings , index 

! terms 

This paper proposes XConnector, a language for the creation of complex hypermedia 
relations with causal or constraint semantics. XConnector allows the definition of relations 
independently of which resources are related. Another feature is the specification of 
relation libraries, providing reuse in relationship definition. The main goal is to improve 
linking languages or the linking modules of hypermedia authoring languages in order to 
provide multimedia synchronization capabilities using links. ... 

Keywords: XConnector, XLink, hypermedia connector, links, multimedia synchronization 



Links for a better web: Enhanced web document summarization using hyperlinks 
j.-Y. Delort, B. Bouchon-Meunier, M. Rifqi 

August 2003 Proceedings of the fourteenth ACM conference on Hypertext and 

hypermedia 
Publisher: ACM Press 

Full text available: pdf(167.88 KB) Additional Information: full citation , abstract, references , citings, index 
^^-^ terms 

This paper addresses the issue of Web document summarization. As textual content of 
Web documents is often scarce or irrelevant and existing summarization techniques are 
based on it, many Web pages and websites cannot be suitably summarized. We consider 
the context of a Web document by the textual content of all the documents linking to it. 
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To summarize a target Web document, a context-based summarizer has to perform a 
preprocessing task, during which it will be decided which pieces of informati ... 

Keywords: context, hyperlinks, summarization, web document 



10 Xanalogical structure, needed now more than ever: parallel documents, deep links to §|§ 

|k content, deep versioning T and deep re-use 
^ Theodor Holm Nelson 

December 1999 ACM Computing Surveys (CSUR) 

Publisher: ACM Press 

Full text available: ^ pdf(787.72 KB) Additional Information: full citation , references , citings , index terms 



11 Querying structured documents with hypertext links using OODBMS 
V. Christophides, A. Rizk 

September 1994 Proceedings of the 1994 ACM European conference on Hypermedia 
technology 

Publisher: ACM Press 

Full text available* W\ df(1 32 MB) Additional Information: full citation , abstract , references , citings , index 

terms 

Hierarchical logical structure and hypertext links are complementary and can be combined 
to build more powerful document management systems. Previous work exploits this 
complementarity for building better document processors, browsers and editing tools, but 
not for building sophisticated querying mechanisms. Querying in hypertext has been a 
requirement since [19] and has already been elaborated in many hypertext systems, but 
has not yet been used for hypertext systems superimposed on an und ... 

Keywords: hypertexts, information retrieval, object oriented databases, path 
expressions, query languages, structured documents 



12 Document Analysis and Retrieval: Latent semantic linking over homogeneous 
repositories 

^ Alessandra Alaniz Macedo, Maria da Graga Campos Pimentel, Jose Antonio Camacho Guerrero 
November 2001 Proceedings of the 2001 ACM Symposium on Document engineering 

Publisher: ACM Press 

Full text available ^pdf(305.72 KB) Additional Information: full citation , abstract, references , citings, index 

terms 

We present a framework for the automatic generation of links based on salient semantic 
structures extracted from homogeneous web repositories, and discuss an imple-mentation 
of the framework. For this study, we consider homogeneous the repositories of the 
eClass, an instrumented environment that automatically captures details of a lecture and 
provides effective multimedia-enhanced web-based in-terfaces for users to review the 
lecture, and the CoWeb, a web-based service for collaborative authorin ... 

Keywords: automatic linking, information integration, information retrieval, semantic 
structures 



13 Link Analysis: Improvement of HITS-based algorithms on web documents 

Longzhuang Li, Yi Shang, Wei Zhang 
▼ May 2002 Proceedings of the 11th international conference on World Wide Web 
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Publisher: ACM Press 

Full text available- t Bpdff214.35 KB) Additional Information: full citation , abstract, references , citings, index 
LlJ "^ terms 

In this paper, we present two ways to improve the precision of HITS-based algorithms on 
Web documents. First, by analyzing the limitations of current HITS-based algorithms, we 
propose a new weighted HITS-based method that assigns appropriate weights to in-links 
of root documents. Then, we combine content analysis with HITS-based algorithms and 
study the effects of four representative relevance scoring methods, VSM, Okapi, TLS, 
and CDR, using a set of broad topic queries. Our experi ... 

Keywords: HITS-based algorithms, information retrieval, relevance scoring methods 



14 Document detection: PR-LINK system: phase I summary 
Elizabeth D. Liddy, Sung H. Myaeng 

September 1993 Proceedings of a workshop on held at Fredericksburg, Virginia: 
September 19-23, 1993 

Publisher: Association for Computational Linguistics 

Full text available: ^ pdf(1.66 MB) Additional Information: full citation , abstract , references 

The underlying principle of the DR-LINK System is that retrieval must be at the 
conceptual level, not the word level. That is, a successful retrieval system must retrieve 
on the basis of what people mean in their query, not just what they say in their query. 
The same is true of documents - their representation needs to capture the content at the 
conceptual level of expression. To accomplish this human-like goal, DR-LINK aims to 
represent and match documents and queries at all of the avail ... 

15 Linking object oriented database and hypertext to support software documentation 

❖ Bing Wang, Peter Hitchcock 
November 1992 Proceedings of the 10th annual international conference on Systems 

documentation 
Publisher: ACM Press 

Full text available: ^| pdf(733.00 KB) Additional Information: full citation , abstract , references , index terms 

This paper describes using a general hypertext-based system Intersect to support the 
definition and manipulation of software documentation. Object oriented database is used 
as a basis for supporting both the application modelling and hypertext. There are two 
main advantages, firstly, Intersect supports both application modelling and corresponding 
hypertext structure dynamically; Secondly, Intersect supports versions of any complex 
object ty ... 

16 Consistent framework documentation with computed links and framework contracts 
& Serge Demeyer, Koen De Hondt, Patrick Steyaert 

^ March 2000 ACM Computing Surveys (CSUR) 

Publisher: ACM Press 

Full text available: ^ pdf(83.03 KB) Additional Information: full citation , references , citings , index terms 



17 Text summarization: Multiple & single document summarization using DR-LINK 
Mary McKenna, Elizabeth Liddy 

October 1998 Proceedings of a workshop on held at Baltimore, Maryland: October 13 
15, 1998 

Publisher: Association for Computational Linguistics 

Full text available: pdf(607.29 KB) Additional Information: full citation , abstract , references 
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Our Tipster Phase III research objective for the Summarization task is to produce a single 
summary across multiple documents returned from a search on an information retrieval 
system. An established set of metrics to evaluate the performance of our system is not 
available in this field at present, so this research is also developing a procedure to 
evaluate the summaries we create. We hope to uncover useful metrics and evaluation 
variables that can be used by others working in this area. 



18 Multimedia authoring: Linking multimedia presentations with their symbolic source 
^ documents: algorithm and applications 
^ Berna Erol, Jonathan J. Hull, Dar-Shyang Lee 

November 2003 Proceedings of the eleventh ACM international conference on 
Multimedia 

Publisher: ACM Press 

Full text available: |£ ] pdf(472.17 KB) Additional Information: full citation , abstract , references , index terms 

An algorithm is presented that automatically matches images of presentation slides to the 
symbolic source file (e.g., PowerPoint™ or Acrobat™) from which they were generated. 
The images are captured either by tapping the video output from a laptop connected to a 
projector or by taking a picture of what's displayed on the screen in a conference room. 
The matching algorithm extracts features from the image data, including OCR output, 
edges, projection profiles, and layout and determine ... 

Keywords: document linking, e-learning, meeting recording, multimedia meeting room, 
presentation recording, synchronization 



19 Document creation II: Page composition using PPML as a link-editing script 
Steven R. Bagley, David F. Brailsford 

October 2004 Proceedings of the 2004 ACM symposium on Document engineering 
Publisher: ACM Press 

Full text available: ^] pdf(197,33 KB) Additional Information: full citation , abstract , references , index terms 

The advantages of a COG (Component Object Graphic) approach to the composition of 
PDF pages have been set out in a previous paper [1]. However if pages are to be 
composed in this way then the individual graphic objects must have known bounding 
boxes and must be correctly placed on the page in a process that resembles the link 
editing of a multi-module computer program. Ideally the linker should be able to utilize al 
declared resource information attached to each COG. 

We have investiga ... 



Keywords: PDF, PPML, form Xobjects, graphic objects, link editing 



20 Re-linking Multi-Page Web Documents 
Jim Weirich 

May 1997 Linux Journal 

Publisher: Specialized Systems Consultants, Inc. 

Full text available: W\ html(9.42 KB) Additional Information: full citation , index terms 



Results 1 - 20 of 200 Result page: 1 2345678910 next 

The ACM Portal is published by the Association for Computing Machinery. Copyright © 2006 ACM, Inc. 
Terms of Usage Privacy Policy Code of Ethics Contact Us 



http://portal.acm.org/results.cfm?coll=GUIDE&dl=GUIDE&C^ 3/30/06 



Results (page 1): linked documents 



Page 7 of 7 



Useful downloads: HI Adobe Acrobat %k QuickTime ti l Windows Media Player Real Player 



http://portal.acm.org/results.cfm?coll=GUIDE&dl=GUIDE&CFID=68432352&CFTOKEN=... 3/30/06 



